Ensemble Learning of Economic Taxonomy Relations from Modern Greek Corpora
نویسنده
چکیده
This paper proposes the use of ensemble learning for the identification of taxonomic relations between Modern Greek economic terms. Unlike previous approaches, apart from is-a and part-of relations, the present work deals also with relation types that are characteristic of the economic domain. Semantic and syntactic information governing the term pairs is encoded in a novel feature-vector representation. Ensemble learning helps overcome the problem of performance instability and leads to more accurate
منابع مشابه
Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text
This paper describes Eksairesis, a system for learning economic domain knowledge automatically from Modern Greek text. The knowledge is in the form of economic terms and the semantic relations that govern them. The entire process in based on the use of minimal language-dependent tools, no external linguistic resources, and merely free, unstructured text. The methodology is thereby easily portab...
متن کاملLearning Subcategorization Frames from Corpora: a Case Study for Modern Greek
Certain Natural Language Processing (NLP) applications such as parsing and semantic processing require complete lexicons that provide subcategorization information for a word of interest, i.e. the necessary information about the set(s) of syntactic constituents the word must combine with, in order for its meaning to be fully expressed. Modern Greek presents high flexibility in the allowable ord...
متن کاملEnsemble Learning for Low Resources Prepositional Phrase Attachment
Prepositional phrase attachment is a major disambiguation problem when it’s about parsing natural language, for many languages. In this paper a low resources policy is proposed using supervised machine learning algorithms in order to resolve the disambiguation problem of prepositional phrase attachment in Modern Greek. It is a first attempt to resolve prepositional phrase attachment in Modern G...
متن کاملA Short Survey on Taxonomy Learning from Text Corpora: Issues, Resources and Recent Advances
A taxonomy is a semantic hierarchy, consisting of concepts linked by is-a relations. While a large number of taxonomies have been constructed from human-compiled resources (e.g., Wikipedia), learning taxonomies from text corpora has received a growing interest and is essential for longtailed and domain-specific knowledge acquisition. In this paper, we overview recent advances on taxonomy constr...
متن کاملDELOS: An Automatically Tagged Economic Corpus for Modern Greek
Text corpora resources have become an essential tool for Natural Language Processing tasks over the past years. A wide range of applications like information retrieval, ontology and terminology extraction require a sufficiently large corpus but of restricted domain. Manual tagging of such a corpus is very costly, making automatic annotation by a set of linguistic tools a very challenging idea. ...
متن کامل